Improved dependability for dynamically reconfigurable hardware: Restoration of the reliability index via replication and error correction
نویسندگان
چکیده
Fault-tolerant (FT) architectures based on classic spatial and temporal redundancy are used in an increasing number of applications. However, the hardware platforms underlying modern highreliability systems have little resemblance to those that were common when such architectures were devised. The earlier fault models are not necessarily out-of-date (e.g. stuck-at faults still play an important role for validating FT applications), but the new failure modes of nanometer technologies were largely irrelevant when J. von Neumann’s paper on the synthesis of reliable organisms from unreliable components was published in the 1950s. Such concerns are particularly relevant when designing high-reliability adaptive systems, where reconfigurable field-programmable gate arrays (FPGAs) are increasingly used. On the other hand, the economics of FT architectures based on spatial redundancy (e.g. triple modular redundancy, TMR), are entirely different when evaluated under the assumption of such features as dynamic reconfiguration, which enables just-in-time implementation of only those resources that need to be available at any given time, or self-reconfiguration, which enables self-contained corrective actions that are able to isolate / replace defective resources. New design approaches are 2 Improved dependability for dynamically reconfigurable hardware therefore required to cope with the challenges introduced by each new generation of programmable hardware devices. This paper presents an approach to design high-reliability architectures at lower cost, by taking advantage of dynamic / self reconfiguration and built-in test infrastructures, which are present in modern generations of FPGAs. Improved dependability for dynamically reconfigurable hardware 3
منابع مشابه
Improving the dependability of dynamically reconfigurable hardware by concurrent replication of active resources
This presentation describes a low-level technique to replicate active resources (i.e. resources that are being used by functions that are currently running) in dynamically reconfigurable FPGAs, with the main objective of releasing them to be tested in a non-intrusive way. This technique may be used to support i) Online concurrent testing to detect any faults that emerge during system operation,...
متن کاملReconfiguration Strategy for FPGA Dependability Characteristics Improvement based on Stochastic Petri Net
This paper shows the impact of the trade-off between reconfigurable and non-reconfigurable parts of the FPGA to the dependability characteristics of the whole design. Stochastic Petri nets have been used to compute reliability and dependability characteristics in a simple FPGA design with dynamically reconfigurable modules. Some parts of the design are not possible or proper to reconfigure dyna...
متن کاملOn The Design of A Dynamically Reconfigurable Function-Unit for Error Detection and Correction
This paper presents the design of a function-specific dynamically reconfigurable architecture for error detection and error correction. The function-unit is integrated in a pipelined 32 bit RISC processor and provides full hardware support for encoding and decoding of ReedSolomon Codes with different code lengths as well as error detection methods like bit-parallel Cyclic Redundancy Check codes...
متن کاملA Novel Method for Soft Error Mitigation in FPGA using Adaptive Cross Parity Code
Field Programmable Gate Arrays (FPGAs) are more prone to be affected by transient faults in presence of radiation and other environmental hazards compared to Application Specific Integrated Circuits (ASICs). Hence, error mitigation and recovery techniques are absolutely necessary to protect the FPGA hardware from soft errors arising due to such transient faults. In this paper, a new efficient m...
متن کاملClasses of Byzantine Fault-tolerant Algorithms for Dependable Distributed Systems
1 1.1. Dependable computer systems 1 1.1.1. Dependability attributes 2 1.1.2. The impairments to dependability 3 1.1.2.1. Faults, errors, and failures 3 1.1.2.2. Fault and failure classification 3 1.1.3. Dependability measures 5 1.1.4. The means of dependability 7 1.1.5. Overview 11 1.2. Fault detection techniques 11 1.2.1. Duplication 11 1.2.2. Error-detecting codes 12 1.2.3. Checksums 12 1.2....
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2013